Extract Concept using Subtitles in MOOC
نویسندگان
چکیده
Massive open online courses (MOOCs) are a variety of offered through the mode, paid or unpaid and has evolved as an excellent learning resource for students. The structure course design is mainly linear where there few video lectures provided by either professors several universities, people with expertise in particular subject. They usually graded on weekly basis quizzes peer-graded assignments. objective this paper to extract concepts taught videos from subtitles, which could later be used enhance recommendations learners using their clickstream data. teachers also use see demand courses. Evaluate two keyword extraction methods, BERT LDA different Coursera experimental results show that outperforms terms Coherence.
منابع مشابه
Semantic Video Indexing and Summarization Using Subtitles
How to build semantic index for multimedia data is an important and challenging problem for multimedia information systems. In this paper, we present a novel approach to build a semantic video index for digital videos by analyzing the subtitle files of DVD/DivX videos. The proposed approach for building semantic video index consists of 3 stages, viz., script extraction, script partition and scr...
متن کاملUsing Formal Concept Analysis to Extract a Greatest Common Model
Data integration and knowledge capitalization combine data and information coming from different data sources designed by different experts having different purposes. In this paper, we propose to assist the underlying model merging activity. For close models made by experts of various specialities, we partially automate the identification of a Greatest Common Model (GCM) which is composed of th...
متن کاملUsing Linguistic Annotations in Statistical Machine Translation of Film Subtitles
Statistical Machine Translation (SMT) has been successfully employed to support translation of film subtitles. We explore the integration of Constraint Grammar corpus annotations into a Swedish–Danish subtitle SMT system in the framework of factored SMT. While the usefulness of the annotations is limited with large amounts of parallel data, we show that linguistic annotations can increase the g...
متن کاملUnderstanding MOOC Discussion Forums using Seeded LDA
Discussion forums serve as a platform for student discussions in massive open online courses (MOOCs). Analyzing content in these forums can uncover useful information for improving student retention and help in initiating instructor intervention. In this work, we explore the use of topic models, particularly seeded topic models toward this goal. We demonstrate that features derived from topic a...
متن کاملUsing Movie Subtitles for Creating a Large-Scale Bilingual Corpora
This paper presents a method for compiling a large-scale bilingual corpus from a database of movie subtitles. To create the corpus, we propose an algorithm based on Gale and Church’s sentence alignment algorithm(1993). However, our algorithm not only relies on character length information, but also uses subtitle-timing information, which is encoded in the subtitle files. Timing is highly correl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Computer Science and Applications
سال: 2022
ISSN: ['2158-107X', '2156-5570']
DOI: https://doi.org/10.14569/ijacsa.2022.0130176